# Vision-Language-Action Flow
Flower Libero 10
MIT
FlowerVLA is a pre-trained vision-language-action flow policy model for robotic manipulation tasks, trained on the LIBERO 10 dataset with only 1 billion parameters.
Multimodal Fusion English
F
mbreuss
14
2
Flower Calvin D
MIT
FlowerVLA is a vision-language-action flow model pre-trained on the CALVIN D dataset, employing an efficient flow-matching architecture that achieves general-purpose robot operation strategies with only about 1 billion parameters.
Multimodal Fusion English
F
mbreuss
16
3
Flower Calvin Abcd
MIT
FlowerVLA is a robot operation model pre-trained on the CALVIN ABCD dataset, employing an innovative vision-language-action flow strategy with only 1 billion parameters, specifically designed for robot learning.
Multimodal Fusion English
F
mbreuss
24
3
Pi0
Apache-2.0
Pi0 is a general robot control model based on vision-language-action flow, supporting robot control tasks.
Multimodal Fusion
P
lerobot
11.84k
230
Featured Recommended AI Models